THETA BODIES FOR POLYNOMIAL IDEALS 
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Abstract. Inspired by a question of Lovasz, we introduce a hierarchy 
of nested semidefinite relaxations of the convex hull of real solutions to 
an arbitrary polynomial ideal, called theta bodies of the ideal. These re- 
laxations generalize Lovasz's construction of the theta body of a graph. 
We establish a relationship between theta bodies and Lasserre's relax- 
ations for real varieties which allows, in many cases, for theta bodies 
to be expressed as feasible regions of semidefinite programs. Examples 
from combinatorial optimization are given. Lovasz asked to character- 
ize ideals for which the first theta body equals the closure of the convex 
hull of its real variety. We answer this question for vanishing ideals of 
finite point sets via several equivalent characterizations. We also give a 
geometric description of the first theta body for all ideals. 



A central concern in optimization is to understand conv(S'), the convex 
hull of the set of feasible solutions S, to a given problem. In many instances, 
the set of feasible solutions to an optimization problem is the set of real 
solutions to a polynomial system: /i(x) = /2(x) = • • • = / m (x) = 0, where 
/l) • • • j fm € K[x] := R[xi, . . . , x n ]. This set is the real variety, Vr(I), of the 
ideal I in M[x] generated by /i, . . . , f m , and it is often necessary to compute 
or represent conv(V]R(J)) exactly or at least approximately. 

Recall that cl(conv(VR(/))), the closure of conv(VR(/)), is cut out by the 
inequalities /(x) > as / runs over all linear polynomials that are non- 
negative on Vr(/). (Call / € R[x] a linear polynomial if it is affine linear of 
the form / = ao + ^ILi a i x i-) A classical certificate for the non- negativity of 
a polynomial / on Vr(/) is the existence of a sum of squares (sos) polynomial 
Y2j=i tij that is congruent to / mod I (i.e., / — X^=i e -0) written as 
/ = Y2j=i h 2 j mod /. If this is the case, we say that / is sos mod I. Hence 
a natural relaxation of cl(conv(V]R(/))) is the closed convex set: 



Depending on /, (JTJ) may be strictly larger than cl(conv(V]R(I))) since there 
may be polynomials that are non- negative on Vr(I) but not sos mod /. 
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However, in many interesting cases, ([I]) will equal cl(conv(V]R (/))). By- 
bounding the degree of the hj's that appear in the sos representations, and 
gradually increasing this bound, we obtain a hierarchy of relaxations to 
cl(conv(Vig(/))). In [18], Lovasz asked a question that leads to the study of 
this hierarchy. To explain it, we first introduce some definitions. 

Definition 1.1. Let / be a polynomial in R[x], / be an ideal in R[x] with 
real variety V R (i) := {s £ R" : f(s) = V / G /}, and let R[x] fc denote the 
set of polynomials in R[x] of degree at most k. 

(1) The polynomial / is /c-sos mod / if there exists hi, . . . , ht G R[x]^ 
for some t such that / = Ylj=i hj m od /. 

(2) The ideal / is /c-sos if every polynomial that is non-negative on 
Vr(7) is k-sos mod /. If every polynomial of degree at most d that 
is non-negative on Vr(I) is fc-sos mod /, we say that / is (d, fe)-sos. 

Example 1.2. Consider the principal ideal I = (xfx2 — 1) C R[xi,X2]- 
Then coiiv(Vk(/)) = {(si,S2) G R 2 : S2 > 0}, and any linear polynomial 
that is non-negative over Vr(I) is of the form 0x2 + P, where a, (3 > 0. Since 
ax2 + (3 = {y/ax\X2) 2 + (\//3) 2 mod /, / is (1, 2)-sos. Check that X2 is not 
1-sos mod I and so, / is not (1, l)-sos. 

In [18], Lovasz asked the following question. 

Problem 1.3. [181 Problem 8.3] Which ideals in R[x] are (1, \)-sos? How 
about (1, k)-sos? 

The geometry behind the above algebraic question leads to a natural 
hierarchy of relaxations of conv(VR(/)) which we now introduce. The name 
comes from earlier work of Lovasz and will be explained in Section [3l 

Definition 1.4. (1) For a positive integer k, the k-th theta body of 

an ideal / C R[x] is 

THfc(/) := {x G R n : /(x) > for every linear / that is k-sos mod /}. 

(2) An ideal / C R[x] is TH fe -exact if TR k (I) equals cl(conv(V K (/))). 

(3) The theta-rank of / is the smallest k for which / is TH^-exact. 

By definition, TH^I) D TH 2 (I) 3 ■ ■ ■ 3 conv(V H (I)). As seen in Exam- 
ple ll.2l coiiv(Vk(/)) may not be closed while the theta bodies are. Therefore, 
the theta-body sequence of / can converge, if at all, only to cl(conv(ViR(/))). 

A natural question at this point is whether the algebraic notion of an ideal 
being (1, &)-sos is equivalent to the geometric notion of being TH^-exact. 

Lemma 1.5. If an ideal I C R[x] is (1, k)-sos then it is TH^-exact. 

Proof: Let / be (l,fe)-sos and s G R n be not in cl(conv(VR(/))). By 
the separation theorem [TJ Theorem III. 1.3] there exists a linear polynomial 
/, non-negative over cl(conv(V]R(/))), such that /(s) < 0. However, since 
/ is (l,/c)-sos, / is A;-sos mod / and so s G" THfc(I). Hence THfc(/) C 
cl(conv(V R (/))) and so, TH fc (7) equals cl(conv(V R (/))). □ 
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Interestingly, the converse of Lemma 11.51 is false in general. 

Example 1.6. Consider I = (x 2 ) C R[x] with Vr(I) = {0} C E. All linear 
polynomials that are non-negative on Vr(I) are of the form ±a 2 x + b 2 for 
some a, b G R. If b / 0, then (±a 2 x + 6 2 ) = ± 6) 2 mod I. However, ±x 
is not a sum of squares mod /, and hence I is not (1, fc)-sos for any k. On 
the other hand, I is THi-exact since conv(VR(/)) = {0} is cut out by the 
infinitely many linear inequalities ±x + b 2 > as b varies over b ^ 0. 

Definition 1.7. Let I be an ideal in M[x]. Then / is 
(1) radical if it equals its radical ideal 



^7:={/GlR[x] : f™ + g\ + • ■ ■ + g 2 G /, m G N\{0}, 9l , . . . ,g t G R[x]}, 



(3) and zero-dimensional if its complex variety Vc(-f) := {x G C n : 
/(x) = V / G /} is finite. 

Recall that given a set S 1 C R n , its vanishing ideal in M[x] is the ideal 
J(5) := {/ G R[x] : /(s) = V s G S}. Hilbert's Nullstellensatz states that 
for an ideal / C E[x], \/T = X(Vc(J")) and the Real Nullstellensatz states 
that \fl = X(Vr(/)). Hence, I C \/7 C v7, and if / is real radical then it 
is also radical. See for example, [201 Appendix 2], for these notions. 

We will prove in Section [2] that the converse of Lemma 11.51 holds for real 
radical ideals. These ideals occur frequently in applications and for them, 
Problem 11.31 is asking when I is THi-exact, or more generally, TH^-exact. 

Contents of this paper. Recall that a semidefinite program (SDP) is an 
optimization problem in the space of real symmetric matrices of the form: 



where c G M. m and the A/'s are real symmetric matrices. The notation A y 
implies that A is positive semidefinite. SDPs generalize linear programs and 
can be solved efficiently [32J . In Section [2] we prove that under a certain 
technical hypothesis (satisfied by real radical ideals for instance), the theta 
body sequence of an ideal / is a modified version of a hierarchy of relaxations 
for the convex hull of a basic semialgebraic set, due to Lasserre [3 [9]. In 
this case, each theta body is the closure of the projection of a spectrahedron 
(feasible region of a SDP), and an explicit representation is possible using 
the combinatorial moment matrices introduced by Laurent [13]. When / is 
a real radical ideal, we further prove that I is (l,fc)-sos if and only if / is 
THfc-exact which impacts later sections. 

In Section[3]we illustrate the theta body sequence for the maximum stable 
set and maximum cut problems in a graph which are two very well-studied 



V 7 / := {/ G R[x] : f m G /, m G N\{0}}, 
(2) real radical if it equals its real radical ideal 



max 
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problems from combinatorial optimization. The stable set problem moti- 
vated Problem 11.31 We explain this connection in detail in Section [3l 

In Section [4] we solve Problem 11.31 for vanishing ideals of finite point sets 
in W 1 . This situation arises often in applications and is the typical set up 
in combinatorial optimization. Several corollaries follow: If S C W 1 is finite 
and its vanishing ideal I(S) is (1, l)-sos then S is affinely equivalent to a 
subset of {0, l} n and its convex hull can have at most 2 n facets. If S is the 
vertex set of a down-closed 0/1-polytope in W 1 , then I(S) is (1, l)-sos if and 
only if conv(S*) is the stable set polytope of a perfect graph. Families of finite 
sets in growing dimension with (1, l)-sos vanishing ideals are exhibited. 

In Section [5j we give an intrinsic description of the first theta body, 
THi(J), of an arbitrary polynomial ideal I in terms of the convex quadrics 
in /. This leads to non-trivial examples of THi-exact ideals with arbitrarily 
high-dimensional real varieties and reveals the algebraic-geometric structure 
of THi(/). Analogous descriptions for higher theta bodies remain open. 

Remark 1.8. In [10], Lasserre introduced the Schmildgen Bounded Degree 
Representation (S-BDR) and the Putinar-Prestel Bounded Degree Represen- 
tation (PP-BDR) properties of a compact basic semialgebraic set K = {x : 
3i( x ) > 0, . . . ,g m (x) > 0} (where gi S M[x]), defined as follows: 

• K has the S-BDR property if there exists a positive integer k such 
that almost all linear / that are positive over K has a representation 
as / = Ejc[-m] °J9J where aj are sos, gj := H jeJ 9j and the degree 
of crjgj is at most 2k for all J C [m] := {1, . . . , m}. 

• K has the PP-BDR property if there exists a positive integer k such 
that almost all linear / that are positive over K has a representation 
as / = Sj=o o-jgj where aj are sos, go := 1 and the degree of ajgj is 
at most 2k for j = 0, . . . , m. 

Call the smallest such k the S-BDR (respectively, PP-BDR) rank of K. Here 
"almost all" means all except a set of Lebesgue measure zero. Note that the 
PP-BDR property implies that S-BDR property. 

For an ideal I = (/i, . . . , f m ) C M[x], Vk(I) is the, possibly non-compact, 
basic semialgebraic set {x E W 1 : ±/i(x) > 0, ...,±/ m (x) > 0}. When 
Vr(I) is compact, its PP-BDR property is closely related to the (l,A:)-sos 
and THfc-exact properties of /. However, these notions are not exactly com- 
parable since the PP-BDR rank of Vr(I) depends on the choice of generators 
of /, and only the linear polynomials that are positive (as opposed to non- 
negative) over Vr(I). Regardless, note that if Vr(I) has PP-BDR rank k, 
then / has theta-rank at most k. 

Acknowledgments. We thank Monique Laurent and Ting Kei Pong for 
several useful inputs to this paper. We also thank the referees for their many 
constructive comments that helped the organization of the paper. 
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2. Theta Bodies 

In Definition 1 1 .41 we introduced the k-th. theta body of a polynomial ideal 
7 C R[x] and observed that these bodies create a nested sequence of closed 
convex relaxations of conv(VR(7)) with THfc(7) 2 THfc + i(7) 5 conv(VR(7)). 
Lasserre [8] and Parrilo [214123] have independently introduced hierarchies of 
semidefinite relaxations for polynomial optimization over basic semialgebraic 
sets in W 1 using results from real algebraic geometry and the theory of 
moments. We first examine the connection between the theta bodies of an 
ideal 7 and Lasserre's relaxations for conv(V]R(7)). 

2.1. Lasserre's hierarchy and theta bodies. 

Definition 2.1. Let 7 be an ideal in R[x]. The quadratic module of 7 is 

M(I) := {s + 7 : s is sos in R[x]} . 

The £>th truncation of Ai (7) is 

Mk(I) ■= {s + I : s is /c-sos} . 

Both M(I) and Mk(I) are cones in the R- vector space R[x]/7. Let 
(R[x]/7)' denote the set of linear functionals on R[x]/7 and ttj be the projec- 
tion map from (R[x]/7)' to R n defined as 717 (y) = (y(x\ + 7), . . . , y(x n + I)). 
Also let Mk(I)* C (R[x]/7)' denote the dual cone to Mk(I), the set of all 
linear functions on R[x]/7 that are non-negative on A4k(I)- 

Definition 2.2. For y £ (R[x]/7)', let 77^ be the symmetric bilinear form 

H y : R[x]/7 x R[x]/7 — > R 

(f + I,g + I) _ > y(fg + I) 

and 77 yi t be the restriction of H y to the subspace R[x]j/7. 

Recall that a symmetric bilinear form 77 : V x V — > R, where V is a 
R-vector space, is positive semidefinite (written as 77 ^ 0) if H(v, v) > for 
all non-zero elements v £ V. Given a basis B of V, the matrix indexed by 
the elements of B with (pi, 6j)-entry equal to H(bi,bj) is called the matrix 
representation of 77 in the basis B. The form 77 is positive semidefinite if 
and only if its matrix representation in any basis is positive semidefinite. 

Lemma 2.3. Let I C R[x] be an ideal and k a positive integer. Then 
M k (I)* ={y€ (R[x]/7)' : H y>k y 0}. 

Proof: Note that y € Mk(I)* if and only if y(s + 7) > for all fc-sos 
polynomials s. By linearity of y this is equivalent to y(h 2 + 7) > for all 
h € R[x]fc which is the definition of 77^ being positive semidefinite. □ 

The original Lasserre relaxations in |8] approximate conv(S') for a basic 
semialgebraic set 5 = {x£ R n : gi(x) > 0, i = 1, . . . , m} by the sets 

^(y(x 1 ),...,y(x n )) :y€R[x]', y(l) = 1, V \f^s ig ^ >0 
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where Sj are sos, go := 1 and the degree of Sigi is bounded above by some 
fixed positive integer. When there are equations among the gi(x) > 0, 
both Lasserre [9] (for 0/1 point sets) and Laurent [13] (more generally for 
finite varieties) propose doing computations mod the ideal generated by the 
polynomials defining the equations, to increase efficiency. We adopt this 
point of view since in our case, S = Vr(7), is cut out entirely by equations, 
and work with the following definition of a Lasserre relaxation. 

Definition 2.4. Let 7 C R[x] be an ideal, A: be a positive integer, and 3-1 
be the hyperplane of all functions y G (R[x]/7)' such that y(l + 7) = 1. The 
k-th modified Lasserre relaxation Qk(I) of couv(Vr(7)) is 

Q k (i) :=7rj(Mfc(J)*nyi). 

While Mk(I)* H 3i is always closed, Qk(I) might not be (see Exam- 
ple ETB]). We first note that Qk(I) is indeed a relaxation of couv(Vr(7)). 

Lemma 2.5. For an ideal I and a positive integer k, conv(VR(7)) C Qk(I). 

Proof: For s G Vr(7), consider y s G (R[x]/7)' defined as y s (f + 7) := /(s). 
Then y s G .M fc (7)* and y s (l + J) = 1. Therefore, 7r 7 (y s ) = s G Q fc (J), and 
conv(V]a(J)) C Qk(I) since Qk(I) is convex. □ 

Since Q/c+i (-f) C Qk(I), these bodies create a nested sequence of relax- 
ations of conv(V]R(/))) as intended. Our main goal in this section is to 
establish a relationship between Qk(I) and the k-th theta body, TEL. (7), of 
the ideal I (cf. Definition 1 1.4p . We start by noting the following inclusion. 

Proposition 2.6. For an ideal I C R[x] and a positive integer k, c\(Qk(I)) Q 
TH fc (/). 

Proof: Since TH^(I) is closed, it is enough to show that Qk(I) Q TH/ C (J). 
Pick p G Qk{I) and y G Mk{I)* n such that 717(2/) = P- Let / = 
ao + X^iLi a i x i an< ^ / + € TWjfc(I). Then, p G THfc(J) since 

n 

/(p) = /(7T/fo)) = a y(l + I) + 2 a^^ + I) = y(f + I) > 0. 

i=l 

□ 

Theorem 12.81 will prove that if M.f.(I) is closed, we have the equality 
c\(Q k (I)) =TH fc (7). 

Lemma 2.7. Lei / C R[x] be an ideal and k be a positive integer. If 
f G R[x]i is non-negative over Qk{I), then f + I G cl(Mk(I))- 

Proof: Suppose / G R[x]i is non- negative over Qk{I) and / + I G" 
cl(Mk(I))- Then by the separation theorem, there exists y G Mk(I)* such 
that y(f + I) < 0. Since (/ + r + J) 2 = (/ + r) 2 + 7 lies in M k (I) for any 
real number r, y G an d y is linear, we get 

< y((f + r + I) 2 ) = y(f 2 + 7) + 2ry(f + 7) + r 2 y(l + 7) 
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which implies that y(l + /) > since y(f -+- I) 7^ 0. Scaling y such that 
y(l + I) = 1, we have that y G Mk(I)*nyi. This implies that 717(2/) G Qk(I) 
and therefore, by hypothesis, f(ni(y)) > 0. However, since / G M[x]i and y 
is linear, we also get f(nj(y)) = y(f + /) < which is a contradiction. □ 

Theorem 2.8. Let I C R[x] be an ideal. For a positive integer k, if Aik(I) 
is closed, then c\(Qk(I)) = THfc(I). 

Proof: By Proposition 12.61 we need to prove that when is closed, 

THfc(I) C cl(Qfc(I)). Suppose p G" c\{Qk{I))- By the separation theo- 
rem, there exists / G R[x]i non-negative on cl(Qk(I)) with /(p) < 0. By 
Lemma 12771 / + I € .Mfc(I) since Aik(I) is closed by assumption and hence 
/ is /c-sos mod /. Since /(p) < 0, p THfc(I). □ 

An important class of ideals for which is closed is the set of real 

radical ideals which are the focus of Sections [3] and |H We now derive various 
corollaries to Theorem 12.81 that apply to real radical ideals. 

Corollary 2.9. If I C R[x] is a real radical ideal then cl(Qk{I)) = THfc(I). 
Proof: By 125] Prop 2.6], if / is real radical, then A4k(I) is closed. □ 

Lemma 2.10. Let V and W be finite dimensional vector spaces, H C W 
be a cone and A : V — > W be a linear map such that A(V) n mt(H) ^ 0. 
Then (A~ l H)* = A'{H*) where A' is the dual operator to A. In particular, 
A'(H*) is closed in V. 

Proof: This follows from Corollary 3.3.13 in [2] by setting K = V. □ 

Corollary 2.11. Let I be a real radical ideal in R[x] and k be a positive 
integer. If there exists g G R[x]i such that g + I is in the interior of M.k{I) 
(considered as a subset o/M[x] 2 fc// / ), then THfc(/) = Qk{I)- 

Proof: By Corollary 12.91 it suffices to show that Qk(I) is closed. Consider 
R n+1 with coordinates indexed 0,1, ... ,n and the map 

frj : (R[x]/J)' -> R n+1 such that y ^ (y(l + J), 7r/(y)). 

Then Tti(M k (I)* n ^1) = {(l,p) : P G Qfc(I)} and so, Q fc (7) will be closed 
if wi(M k (iy n ^1) = wi(M k (I)*) n {p G R n+1 : po = 1} is closed. Hence, 
it suffices to show that 7r I (Mk(I)*) Q R n+1 is closed. 

Now consider the inclusion map A : R[x]i/i~ — > R[x]2fc/I and let M de- 
note the cone Aik{I) considered as a subset of R[x]2fc//. By assumption, 
A(R[x]i/J) n int(M) = (R[x]i/J) n int(M) / and so by Lemma EEl 
A'(M*) is closed. Let C := {1*, x*, . . . , X*} be the canonical basis of 
(R[x]i/I)'. Then A' (y) = (y(l + 1), y{x\ +/),..., + 1)) with respect to 
C. Now note that if y G Mk(I)* then its restriction y to R[x]2fc/I belongs 
to M* and A'(y) = 7f/(y). Therefore, n I (M k (I)*) C A' (AT). Conversely, if 
y G M* and y is any extension to R[x]/I, then y belongs to Mk{I)*- Since 
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A'{y) = 7rj(y) we get A'(M*) C n I (M k (I)*) and so, A'(M*) = ^(M^I)*) 
is closed. □ 

Let / be an ideal and k a positive integer. In Lemma f 1.5 1 we saw that if I is 
(1, k)-sos (i.e., every linear polynomial that is non-negative on Vr(7) is fc-sos 
mod /), then / is TH^-exact (i.e., THfc(J) = cl(conv(V]g (/)))). Example 11.61 
showed that the reverse implication does not always hold. 

Corollary 2.12. If an ideal I C R[x] is real radical then I is (l,k)-sos if 
and only if I is TH^-exact. 

Proof: If / G M[x]i is non-negative on Vr(I) and I is TH^-exact, then / 
is non-negative on TH/ C (J) and hence on Qk{I)- Therefore, by Lemma 12.71 
/ G cl(A4fe(J)). Suppose now that / is also real radical. Then is 
closed and / + / G .M & (/) , which means that I is (1, fc)-sos. □ 

We close with a brief discussion of ideals for which the theta body sequence 
is guaranteed to converge (finitely or asymptotically) to cl(conv(VR(/))). 

(1) If Vr(I) is finite the results in [11] imply that / is TH^-exact for 
some finite k. If Vc(-0 is finite (/ is zero-dimensional), then k can 
be bounded above by the maximum degree of a linear basis of R[x]/J 
[T5] (see Section LT2]l . However, as in / = (x 2 ), we cannot guarantee 
that I is (1, k)-sos for any k, even when / is zero-dimensional. If / is 
zero-dimensional and radical, then in fact, I is (1, A:)-sos for finite k 
with k < \Vc(I)\ ~ 1 (see [22], [HI Theorem 2.4]). Better bounds are 
often possible as in Remark l4.31 For an ideal / C R[x] we summarize 
the above results in the following table. 

Vc(I) finite >Vr(I) finite 

4 t j==jT 

/ (1 ,k)-sns ' , T THfc-exact 

(2) If Vr(I) is not finite but is compact, Schmiidgen's Positivstellensatz 
|20t Chapter 3] implies that the theta body sequence of / converges 
(at least asymptotically) to cl(conv(VR(/))) (i.e., flfcLiTH^I) = 
cl(conv(V R (/)))). 

(3) If Vr(I) is not compact, then the study of the theta body hierarchy 
becomes harder. Scheiderer [201 Chapter 2] has identified ideals I 
with Vr(/) not necessarily compact, but of dimension at most two, 
for which every / > mod / is sos mod /. In all these cases, the 
theta body sequence of / converges to cl(conv(V]R (/))). 

The results of Schmiidgen and Scheiderer mentioned above fit within a 
general framework in real algebraic geometry that is concerned with when 
an arbitrary / G R[x] that is positive or non-negative over a basic semi- 
algebraic set is sos modulo certain algebraic objects defined by the set. We 
only care about real varieties and whether linear polynomials that are non- 
negative over them are sos mod their ideals. Therefore, often there are ideals 
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I that are TH^-exact or (1, £;)-sos for which there are non- linear polynomials 
/ such that / > mod I but / is not sos mod /. For instance, the proof of 
Theorem 15.41 will show that J n := (X^=i x i ~ -0 * s (1> l)- sos f° r an n > but 
a result of Scheiderer [20} Theorem 2.6.3] implies that when n > 4, there is 
always some non- linear / non- negative on Vk(Ju) that is not sos mod J n . 

2.2. Combinatorial moment matrices. To compute theta bodies we must 
work with the truncated quadratic module M.^{I) which requires computing 
sums of squares in R[x]/7 as described in [23], or dually, using the combi- 
natorial moment matrices introduced by Laurent in [13]. We describe the 
latter viewpoint here as it is more natural for theta bodies. 

Consider a basis B = {/o + I,fx + I,...} for R[x]/7, and define deg(/j + 
I) := minj_j jg /deg /. For a positive integer k, let B\. :={// + /€# : 
deg(/z +!)<&}, and set := (fi + I : fi + 1 G Bk). We may assume that 
the elements of B are indexed in order of increasing degree. Let A(9 +/ ) := 
(a| 9+/ ^) be the vector of coordinates of g + I with respect to B. Note that 
\(9+ I ) h as only finitely many non-zero coordinates. 

Definition 2.13. Let y G R B . Then the combinatorial moment matrix 

Mg(y) is the (possibly infinite) matrix indexed by B whose entry is 

The /c-th-truncated combinatorial moment matrix Afg fe (y) is the finite 
(upper left principal) submatrix of Mg(y) indexed by B^. 

Although only a finite number of the components in \(fifj+r) are non- 
zero, for practical purposes we need to control exactly which indices can be 
non-zero. One way to do this is by choosing B such that if / + I has degree 
k then / + 1 G span(Sfc). This is true for instance if B is the set of standard 
monomials of a term order that respects degree [3]. If B has this property 
then Afg fc (y) only depends on the entries of y indexed by £>2fc- 

Theorem 2.14. For each positive integer k, 

proj RBl {y G R B2fc : M Bfc (y) y 0, y = 1} = fi(Q fc (/)), 

where yo is the first entry of y G R S2fe , projjjBj is the projection onto the 
coordinates indexed by B\, and for p G R n , fi(p) := (/^(p))/^/^!- 

Proof: We may identify y = (y$) G M B2fe with the operator y G (R[x]/J)' 
where + I) = y« if /j + I G 02ft and zero otherwise. Then Mg fc (y) is 
simply the matrix representation of Hg^ in the basis B, since we assumed 
that if deg(/j + I),deg(fj + 1) < k then y(fifj + I) depends only on the 
value of y on £> 2 fc- Therefore, projjgBjjy G R B2fe : Mg fc (y) y 0} equals 

+ I))Bi : 1/ G (K[x]/J)', # 5 , fc b 0}. 
Furthermore, since /, is linear whenever fi + lEB\, 

(y(fi + /)) Bl = (/i(T/(y))) Bl =: fi(7r/(y)) 
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so by Lemma EH proj RBl {y G R B ™ : M Bk {y) h 0, y = 1} = fi(Q fe (/). □ 

Corollary 2.15. Suppose B\ = {1 + I , x± + I , . . . , x n + 1} and denote by 
yo,yi, ■ ■ ■ ,y n the first n + 1 coordinates o/y£ M B2fc ; t/ien 

QkiJ) = {(yi, ■ ■ ■ ,Vn) :y£ ^ with M Bk (y) h 0andy = l}- 

By Corollary 12.151 optimizing a linear function over Qk{I), hence over 
cl(Qfc(-0)> i s an SDP and can be solved efficiently 

Example 2.16. Consider the ideal / = (x 2 X2 — 1) C M[xi,X2] from Exam- 
ple 11.21 for which conv(V]R(/)) = {(si,S2) 6M 2 : «2 > 0} was not closed but 
/ was TH2-exact and (1, 2)-sos. Note that B = \Jk^{ x i+I: X \+I, X i x 2+I} 
is a degree-compatible monomial basis for ]R[xi,X2]// for which 

B& = \^t x 1i x 2i x 1i x 1 x 2-, x 2i X\x\, x\, X2, x l x 2i x t > ^2} + I ■ 
The combinatorial moment matrix Mg 2 (y) for y = (1, yi, . . . , y±±) G M. 4 is 





1 


Xl 


X2 


4 


X1X2 


x 2 


1 


( 1 


in 


yi 


2/3 


2/4 


2/5 \ 


Xl 


yi 


in 


in 


2/6 


1 


2/7 


x 2 


y2 


in 


2/5 


1 


2/7 


2/8 


x\ 


2/3 


2/6 


1 


2/9 


2/1 


2/2 


X\X2 


yA 


1 


117 


ill 


2/2 


2/10 


x 2 


\ y5 




ys 


2/2 


2/io 


2/11 / 



If Mg 2 (y) ^ 0, then the principal minor indexed by Xi and x\X2 implies 
that 2/22/3 > 1 an d so in particular, 2/2 7^ for all y G Q2(I)- However, 
since Q2(-0 5 conv(Vifj(7)) = {(^1,52) G M 2 : s 2 > 0}, it must be that 
Q2(I) = conv(V]R(I)) which shows that Q2(I) is not closed. 

Remark 2.17. Example 12. 161 can be modified to show that Qk{I) may not 
be closed even if Vr(I) is finite. To see this, choose sufficiently many pairs 
of points (±t, 1/t 2 ) on the curve x\x2 = 1 to form a set S such that the 
ideal I(S) has a monomial basis B' in which B'^ equals the B4 from above. 
For instance, S = {(±t,l/t 2 ) : t = 1,...,7} will work. Then Q 2 (1(S)) 
coincides with Q2(I) computed above and so is not a closed set. 

We now show that in the particular case of vanishing ideals of 0/1 points, 
which are real radical ideals, the closure in Theorem l2.8l (THt,(/) = cl(Qk(I))) 
is not needed. Most ideals that occur in combinatorial optimization have 
this form and we will see important examples in Section [3l Remark 12.171 
shows that the closure cannot be removed for arbitrary finite point sets. 

Proposition 2.18. If S is a set of 0/1 points in M. n and I = I(S) then for 
all positive integers k, THfc(J) = Qk{I)- 

Proof: By Corollary 12.111 it is enough to show that there is a linear 
polynomial g G M[x] such that g = flAf^ mod / for a positive definite matrix 
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i+i=fi 



fi, mod /, 



A and some basis of K[x]/J with respect to which was determined. Let B 
be a monomial basis for R[x]// and Bk = {l,Pi, • • • ,Pi} + 1- Let c G IR' be 
the vector with all entries equal to —2, and D G M. lxl be the diagonal matrix 
with all diagonal entries equal to 4. Since xf = x% mod I for i = 1, . . . ,n 
and B is a monomial basis, for any f + 1 £ B, f = f 2 mod /. Therefore, the 
constant 

l + l c* 
c D 

and it is enough to prove that the square matrix on the right is positive 
definite. This follows from the fact that D is positive definite and its Schur 
complement (/ + 1) — c t D~ 1 c = 1 is positive (0 Theorem 7.7.6]). □ 

3. Combinatorial Examples 

An important area of application for the theta body hierarchy constructed 
in Section [2] is combinatorial optimization which is typically concerned with 
optimizing a linear function over a finite set of integer points. In this sec- 
tion, we compute theta bodies for two important problems in combinatorial 
optimization - the maximum stable set problem and the maximum cut prob- 
lem in a graph. We explain the observations about the stable set problem 
which motivated Lovasz to pose Problem 1 1.31 The cut problem is modeled in 
two different ways. The first is a non-standard approach which is described 
fully. For the second, more standard model of the cut problem, theta bodies 
provide a new hierarchy of semidefinite relaxations for the cut polytope that 
is studied in detail in [5]- We outline those results briefly here. A recent 
trend in theoretical computer science has been to study the computational 
complexity of approximating problems in combinatorial optimization via the 
standard hierarchies of convex relaxations to these problems such as those 
in [19] and [HJ [9]. Our theta body approach provides a new mechanism to 
establish such complexity results. 

3.1. The Maximum Stable Set Problem. Let G = {[n],E) be an undi- 
rected graph with vertex set [n] = {1, . . . ,n} and edge set E. A stable set 
in G is a set U C [n] such that for all i,j G U, {i,j} E. The maximum 
stable set problem seeks the stable set of largest cardinality in G, the size 
of which is the stability number of G, denoted as a(G). 

The maximum stable set problem can be modeled as follows. For each 
stable set U C [n], let \ u 6 {0,1}™ be its characteristic vector defined as 
( x U) i = i if i G u and ( X u )i = otherwise. Let S G C {0, l} n be the set of 
characteristic vectors of all stable sets in G. Then STAB(G) := conv(S'G') is 
called the stable set polytope of G and the maximum stable set problem is, 
in theory, the linear program max{^]" =1 Xi : x G STAB(G)} with optimal 
value a(G). However, STAB(G) is not known apriori, and so one resorts to 
relaxations of it over which one can optimize Ya=i Xi - 

In [16], Lovasz introduced, TH(G), a convex relaxation of STAB(G), 
called the theta body of G. The problem max{)]" =1 Xi : x G TH(G)} is 
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a SDP which can be solved to arbitrary precision in polynomial time in the 
size of G. The optimal value of this SDP is called the theta number of G and 
provides an upper bound on a(G). See [U Chapter 9] and [29] for more on 
the stable set problem and TH(G). The body TH(G) was the first example 
of a SDP relaxation of a discrete optimization problem and snowballed the 
use of SDP in combinatorial optimization. See [151 [T8] for surveys. Recall 
that a graph G is perfect if and only if G has no induced odd cycles of length 
at least five or their complements. Lovasz showed that STAB(G) = TH(G) 
if and only if G is perfect. This equality shows that the maximum stable 
set problem can be solved in polynomial time in the size of G when G is 
a perfect graph, and this geometric proof is the only one known for this 
complexity result. 

The theta body TH(G) has many definitions (see [61 Chapter 9]) but the 
one relevant for this paper was observed by Lovasz and appears without 
proof in [17] . Let Iq := (xj — Xj V j G [n], x-iXj V {i,j} £ E) C R[x]. 
Then check that Vr(/gO — Sq and that Iq is both zero-dimensional and 
real radical. Lovasz observed that 

(3) TH(G) = {xER" : /(x) > V linear / that is 1-sos mod I G }. 

By Definition O (1), TH(G) is exactly the first theta body, THi(7 G ), of 
the ideal Iq, and by the above discussion, Iq is THi-exact (i.e., TH\(Iq) = 
STAB(G)) if and only if G is perfect. Lovasz observed that, in fact, Iq is 
(1, l)-sos if and only if G is perfect which motivated Problem ll.3l that asks for 
a characterizations of all (1, l)-sos ideals in K[x]. Lovasz refers to a (1, l)-sos 
ideal as a "perfect ideal. A (1, l)-sos ideal I would have the property that its 
first and simplest theta body, THi(J), coincides with cl(conv(V]R(I))) which 
is a valuable property for linear optimization over conv(V^(/)), especially 
when THi(I) is computationally tractable. 

The theta body hierarchy of the ideal Iq therefore naturally extends the 
theta body of G to a family of nested relaxations of STAB(G). Further, 
the connection between TH(G) and sums of squares polynomials motivated 
Definition 11.41 which extends the construction of TH(G) to a hierarchy of 
relaxations of Vr(/) for any ideal / C R[x]. We now explicity describe the 
k-th theta body of Iq in terms of combinatorial moment matrices. 

For U C [n], let x^ 7 := n«GC/ Xi- From the generators of Iq it is clear 
that if / G M[x], then / = g mod Iq where g is in the M-span of the set 
of monomials {xF : U is a stable set in G}. Check that B := {x^ + I G : 
U stable set in G} is a basis of M[x]/Ig containing x±+Ig, • • • , x n +Ic- 

Therefore, by Corollary 12.151 and Proposition 12.181 we have 

]MH,Me Rls fe |x|B fe | such that 

M m = 1, 

TH fc (/ G ) = {yel n : M m = M {l}d = M {i}{i} = Vl 

M uv > = if U U V is not stable in G 
M uw = M ww , if U U U' = W U W 
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THi(/ G ) 



ye 



In particular, indexing the one element stable sets by the vertices of G, 

3M y 0, M G R("+ 1 )x(™+ 1 ) such that 
M 00 = 1, 

M 0i = M i0 = Ma = Vl V i G [n] 
Mij = V {i,j} G E 

This description of THi(Jg) coincides with the semidefinite description 
of TH(G) (see pH Lemma 2.17] for instance) and so, TH(G) = THi(J G ). 
Corollary 12.121 confirms Lovasz's observation and adds to his other charac- 
terizations of a perfect graph as follows. 

Theorem 3.1. j6j Chapter 9] The following are equivalent for a graph G. 

(1) G is perfect. 

(2) STAB(G) = TH(G). 

(3) TH(G) is a polytope. 

(4) The complement G of G is perfect. 

(5) Iq is (1, l)-sos. 

The usual Lasserre relaxations of the maximum stable set problem are set 
up from the following initial linear programming relaxation of STAB(G): 

FRAC(G) :={x£l" : Xi > V i G [n], 1 - Xi - Xj > V {i,j} G E}. 

Note that S G = FRAC(G) n {0, l} n . The fc-th Lasserre relaxation of 
STAB(G) (see 0, [H]) uses both the ideal {xj - Xi : i G [n]) and the 
inequality system describing FRAC(G), whereas in the theta body formu- 
lation, THfc(/c), there is only the ideal Iq and no inequalities. Despite this 
difference, |12| Lemma 20] proves that the usual Lasserre hierarchy is exactly 
our theta body hierarchy for the stable set problem. This interpretation of 
the Lasserre hierarchy provides new tools to understand these relaxations 
such as establishing the validity of inequalities over them as shown below. 

Since no monomial in the basis B of R[x]/1g has degree larger than a(G), 
for any G, Iq is (1, a(G))-sos and STAB(G) = TH a rQ\ (Iq) . However, for 
many non-perfect graphs the theta-rank of Iq can be a lot smaller than a{G). 
For instance if G is a (2k + l)-cycle, then a(G) = k while Proposition 13.31 
below shows that the theta-rank of Iq is two. 

Theorem 3.2. [281 Corollary 65.12a] If G = ([n],E) is an odd cycle with 
n > 5, then STAB(G) is determined by the following inequalities: 

Xi > V i G [n], 1 — Xi > V cliques K in G, a(G) — Xi > 0. 

Proposition 3.3. If G is an odd cycle with at least five vertices, then Iq is 
(l,2)-sos and therefore, TH2-exact. 

Proof: Let n = 2k+l and G be an n-cycle. Then Iq = (xf—Xi, XiXi + \ V i G 
[n]) where x n+ \ = x\. Therefore, (1 — Xi) 2 = 1 — Xi and (1 — Xi — Xi + i) 2 = 
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1 — Xi — Xi + i mod Iq. This implies that, mod Iq, 

Pi ■= ({~L-Xl)(l-X 2 i-X 2 i + l)) 2 =Pi = l-X 1 -X 2 i-X2i + l+X 1 X2i+X 1 X2i+l- 

Summing over % = l,..,k, we get 

k 2k+l 2k 

= k — kxi — Xi + x\Xi mod Iq 

i=l i=2 i=3 

since 2:1X2 and xix 2 k+i he in Iq. Define gi := x\{l — x 2 i+\ — x 2 i +2 ). Then 
gf — gi E Iq and mod Iq we get that 

fc-l 2k k k-1 2k+l 

yZdf = ( k ~ 1 ) X ^ ~ ^xiXi, which implies ^Pi + ^gf = k - y~] Xj. 

i=l i=3 i=l i=l i=l 

To prove that Iq is (1, 2)-sos it suffices to show that the left hand sides of 
the inequalities in the description of STAB(G) in Theorem 13.21 are 2-sos mod 
Iq since by Farkas Lemma |27j . all other linear inequalities that are non- 
negative over Sq are non-negative real combinations of a set of inequalities 
defining STAB(G). Clearly, Xi = xf mod Iq for all i G [n] and one can 
check that for each clique K, (1 — ^eif Xi ) = ( 1 "E 4e x^) 2 m °d la- The 
previous paragraph shows that k — X^Si^ 1 x i is a l so 2-sos mod Iq. □ 

An induced odd cycle C 2 k+i in G, yields the well-known odd cycle in- 
equality J2i£C 2k +i Xi — a (^2fc+i) = k that is satisfied by Sq [5J Chapter 
9]. Proposition 13.31 implies that for any graph G, TH 2 (Iq) satisfies all odd 
cycle inequalities from G since every stable set U in G restricts to a stable 
set in an induced odd cycle in G. This general result can also be proved 
using results from [19] and [12]. The direct arguments used in the proof 
of Proposition 13.31 are examples of the algebraic inference rules outlined by 
Lovasz in [17] . Similarly, one can also show that other well-known classes of 
inequalities such as the odd antihole and odd wheel inequalities [6J Chapter 
9] are also valid for TH^/g)- Schoenebeck [26] has recently shown that 
there is no constant k such that STAB(G) = TH^/g) for all graphs G (as 
expected, unless P=NP). However, no explicit family of graphs that exhibit 
this behaviour is known. 

3.2. Cuts in graphs. Given an undirected connected graph G = ([n],E) 
and a partition of its vertex set [n] into two parts V\ and V 2 , the set of edges 
{i, j} £ E such that exactly one of % or j is in V\ and the other in V 2 is the cut 
in G induced by the partition (Vi,V 2 ). The cuts in G are in bijection with 
the 2"" 1 distinct partitions of [n] into two sets. The maximum cut problem 
in G seeks the cut in G of largest cardinality. This problem is NP-hard and 
has received a great deal of attention in the literature. A celebrated result 
in this area is an approximation algorithm for the max cut problem, due to 
Goemans and Williamson [4], that guarantees a cut of size at least 0.878 of 
the optimal cut. It relies on a simple SDP relaxation of the problem. 
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We first study a non-standard model of the max cut problem. Let 

SG := {x F ■ F C E is contained in a cut of G} C {0, 1} E . 

Then the weighted max cut problem with non-negative weights w e on the 
edges e G E is max {^ee-B "'^e : x ^ -S'G}, and the vanishing ideal 

T(SG) = (Xg — x e , x T : e £ E, T odd cycle in G). 

A basis of R[x]/T(5G) is 

5 = {x 17 + I(SG) : U C does not contain an odd cycle in G} 

and 1 + X(SG),x e + X(SG) (Ve G £) lie in 5. Therefore, 

]M^fl,Me Rls fe |x|B fe | such that 
M m = 1, 

THfe (X(5G)) = <J y G R E : M 0{i} = M {i}0 = M {i}{i} = ^ 

Mf/c/' =0ifUUU' has an odd cycle 
Muu> = M ww , if [/ U U' = W U W 



In particular, 
THi (T(5G)) = 



3MH,Me M (| J E|+i)x(| J E|+i) guch that 

ye» B : M 00 = 1, 

M 0e = M e0 = M ee = y e V e G f? 



Note that for any graph G, THi (T(SG)) is the unit cube in M. E which 
may not be equal to conv(S'G). This stands in contrast to the case of stable 
sets for which THi(Ig) is a polytope if and only if THi(Ig) = STAB(G). 

Proposition 3.4. The ideal I {SG) is THi-exact if and only if G is a bi- 
partite graph. 

Proof: This follows immediately from the description of THi (2(SG)) and 
from the fact that G is bipartite if and only if it has no odd cycles. □ 

Since the maximum degree of a monomial in B is the size of the max cut 
in G, the theta-rank of I{SG) is bounded from above by the size of the max 
cut in G. 

Proposition 3.5. There is no constant k such that I(SG) is TH^- exact for 
all graphs G. 

Proof: Let G be a (2k + l)-cycle. Then TR k (l(SG)) + conv(SG) since 
the linear constraint imposed by the cycle in the definition of THfc(X(5G)) 
will not appear in theta bodies of index k or less. □ 

The theta bodies of a second, more standard, formulation of the weighted 
max cut problem are studied in [5]. In this setup, each cut G in G = ([n], E) 
is recorded by its cut vector \ C £ {ilj^ with xfijj = 1 if {hj} G and 
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= — ^ ^ {?">j} ^ Let E n denote the edge set of the complete graph 
K n , and tte be the projection from W En to M E . The cut polytope of G is 

CUT(G) := conv{x C : C is a cut in G} C E B = 7r fi (CUT(ir„)), 

and the weighted max cut problem, for weights w e £ M. (V e £ i?) becomes 



max 



Jl^^l-^) :xeCUT(G)l 

I ee£ J 



In [5], the vanishing ideal IG of the cut vectors {x^ : C is a cut in G} is 
described and a combinatorial basis B for ME/IG is identified. Using these, 
the fc-th theta body, TH^/G), of IG can be described as: 

3M >r 0, M £ m\Bk\x\B k \ suc h that 
y£R E : M u = 1 

M FuF2 = M F . i)Fi if FiAF 2 AF 3 AF 4 is a cycle in G 

These theta bodies provide a new canonical set of SDP relaxations for 
CUT(G) that exploits the structure of G directly. It is also shown in [5] 
that IG is THi-exact if and only if G has no i^5-minor and no induced cycle 
of length at least five which answers Problem 8.4 posed by Lovasz in [18j . 

Remark 3.6. We remark that the stable set problem and the first for- 
mulation of the max cut problem discussed above are special cases of the 
following general setup. Let A be an abstract simplicial complex (or in- 
dependence system) with vertex set [n] recorded as a collection of sub- 
sets of [n], called the faces of A. The Stanley -Reisner ideal of A is the 
ideal J a generated by the squarefree monomials x^x^ ■ ■ ■ xi k such that 
{ii, 12, ■ ■ ■ , ik} Q [n] is not a face of A. If /a '■= J A + (xf — X{ : i £ [n]), 
then Vr(Ia) = {s £ {0, l} n : support(s) £ A}. For T C [n], recall that 
x T := HieT 3 "*- Then B := {x 1 * : T £ A} + /a is a basis for R[x]//a 
containing 1 + Ia, x\ + Ia, ■ ■ ■ ,x n + /a- Therefore, by Corollary 12.151 and 
Proposition 12.181 the fc-th theta body of /a is 

TH fc (/ A ) = proj, r ;// {y £ R B2k : M Bk (y) y 0, y = 1}. 

Since B is in bijection with the faces of A, and xf — xi £ Ia for all i £ [n], 
the theta body can be written explicitly as follows: 

3M h 0, M £ Rl^lxl^lsuch that 
M m = 1, 

TH fc (/ A ) = { y £ M n : M m = M {i}d = M {i}{i} = Vl 

Muu> = if U U U' $ A 
M vu > = M ww , if U U U' = W U W 

If the dimension of A is d — 1 (i.e., the largest faces in A have size d), then 
Ia is (1, d)-sos and therefore, TH^-exact since all elements of B have degree 
at most d. However, the theta-rank of Ia could be much less than d. 
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4. Vanishing ideals of finite sets of points 

Recall that when S C R™ is finite, its vanishing ideal I(S) is zero- 
dimensional and real radical. 

Definition 4.1. We say that a finite set S C R n is exact if its vanishing 
ideal 1(S) C R[x] is THi-exact. 

We now answer Lovasz's question (Problem 11 .3 j) for vanishing ideals of 
finite point sets in R n . 

Theorem 4.2. For a finite set S C R n , the following are equivalent. 

(1) S is exact. 

(2) I(S) is (l,l)-sofl. 

(3) There is a linear inequality description ofconv(S), of the form gi(x) > 
(i = 1, . . . , m), where each gi is 1-sos mod T{S). 

(4) There is a linear inequality description ofconv(S), of the form gt{x) > 
(i = l,...,m), where each gi is an idempotent mod I(S), i.e., 
gf - Qi € T{S) fori = l,...,m. 

(5) There is a linear inequality description o/conv(5), of the form gi(x) > 
(i = 1, . . . , m), where each gi takes at most two different values in 
S, i.e., for each i, S is contained in the union of the hyperplane 
gi(x) = and one unique parallel translate of it. 

Proof: Since I(S) is real radical, by Corollary 12.121 (1) ^ (2). 

The implication (2) (3) follows from the fact that conv(S') has a finite 
linear inequality description, since S is finite. The implication (3) =>■ (2) 
follows from Farkas lemma, which implies that any valid inequality on S is 
a non-negative real combination of the linear inequalities gi(x) > 0. 

Suppose (3) holds and conv(S) is a full-dimensional polytope. Let F 
be a facet of conv(S'), and g(x) > its defining inequality in the given 
description of conv(S). Then g(x) is 1-sos mod T(S) if and only if there are 
linear polynomials h±, . . . ,hi £ R[x] such that g = h\ + • • ■ + hf mod T(S). 
In particular, since g(x) = on the vertices of F, and all the hf are non- 
negative, each hi must be zero on all the vertices of F. Hence, since the h^s 
are linear, they must vanish on the affine span of F which is the hyperplane 
defined by g(x) = 0. Thus each hi must be a multiple of g and g = ag 2 
mod T(S) for some a > 0. We may assume that a = 1 by replacing p(x) 
by </(x) := ag(x). If conv(<S) is not full-dimensional, then since mod I(S), 
all linear polynomials can be assumed to define hyperplanes whose normal 
vectors are parallel to the affine span of S, the proof still holds. Therefore, 
(3) implies (4). Conversely, since if for a linear polynomial g, g = g 2 mod 
I(S), then g is 1-sos mod T(S), (4) implies (3). 

The equivalence (4) 44> (5) follows since g = g 2 mod T(S) if and only if 
j(b)(1- S (s))=0Vs€S. □ 

Recall from the discussion at the end of Section 2.1 that by results of 
Parrilo, if I C R[x] is a zero-dimensional radical ideal, then the theta-rank 



18 JOAO GOUVEIA, PABLO A. PARRILO, AND REKHA R. THOMAS 



of / is at most |Vc(-0| ~~ 1- Better upper bounds can be derived using the 
following extension of Parrilo's theorem. 

Remark 4.3. Suppose S C R n is a finite set such that each facet F of 
conv(iS') has a facet defining inequality hp(x) > where hp takes at most 
t + 1 values on S, then I(S) is TH^-exact: In this case, it is easy to construct 
a degree t intepolator g for the values of y/hp on S, and we have hp = g 2 
mod 1{S). The result then follows from Farkas Lemma. 

Remark 4.4. The theta-rank of I(S) could be much smaller than the up- 
per bound in Remark 14.31 Consider a (2t + l)-cycle G and the set Sq of 
characteristic vectors of its stable sets. Proposition 13.31 shows that Z{Sg) 
is TH2-exact. However, we need t + 1 translates of the facet cut out by 
Ya=i X{ = t to cover Sg- 

In the rest of this section we derive various consequences of Theorem 14.21 
Finite point sets with property (5) in Theorem 14.21 have been studied in 
various contexts. In particular, Corollaries 14.51 [4791 and 14. 1 II below were also 
observed independently by Greg Kuperberg, Raman Sanyal, Axel Werner 
and Giinter Ziegler (personal communication). In their work, conv(5) is 
called a 2-level polytope when property (5) in Theorem 14.21 holds. 

If S is a finite subset of Z n and C is the smallest lattice in Z n contain- 
ing S, then the lattice polytope conv(S) is said to be compressed if every 
reverse lexicographic triangulation of the lattice points in conv(S') is unimod- 
ular with respect to C Compressed polytopes were introduced by Stanley 
|30| . Corollary 14.51 (4) and Theorem 2.4 in [31] (see also the references after 
Theorem 2.4 in [31] for earlier citations of part or unpublished versions of 
this result), imply that a finite set S C W 1 is exact if and only if conv(S') is 
affinely equivalent to a compressed polytope. 

Corollary 4.5. Let S, S' C R n be exact sets. Then 

(1) all points of S are vertices of conv(5) ; 

(2) the set of vertices of any face of conv(5) is again exact, 

(3) the product S x S' is exact, and 

(4) conv(S') is affinely equivalent to a 0/1 polytope. 

Proof: The first three properties follow from Theorem 14.21 (5). If the 
dimension of conv(5) is d (< n), then conv(S) has at least d non-parallel 
facets. If a-x > b cuts out a facet in this collection, then conv(S') is supported 
by both {x G M n : a • x = 6} and a parallel translate of it. Taking these 
two parallel hyperplanes from each of the d facets gives a parallelepiped. By 
Theorem 14. 2\ S is contained in the vertices of this parallelepiped intersected 
with the affine hull of S. This proves (4). □ 

By Corollary 14.51 (4), it essentially suffices to look at subsets of {0,1}" 
to obtain all exact finite varieties in M. n . In M 2 , the set of vertices of any 
0/1-polytope verify this property. In M 3 there are eight full-dimensional 
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Figure 1. Convex hulls of exact 0/1 point sets in M 3 . 




Figure 2. Convex hulls of non-exact 0/1 point sets in M 3 . 

0/1-polytopes up to affine equivalence. In Figures Q] and [2] the convex hulls 
of the exact and non-exact 0/1 configurations in ]R 3 are shown. 

Example 4.6. The vertices of the following 0/1-polytopes in M n are exact 
for every n: (1) hypercubes, (2) (regular) cross polytopes, (3) hypersim- 
plices (includes simplices), (4) joins of 2-level polytopes, and (5) stable set 
polytopes of perfect graphs on n vertices. 

Theorem 4.7. If S is a finite exact point set then conv(5) has at most 2 d 
facets and vertices, where d = dimconv(5). Both bounds are sharp. 

Proof: The bound on the number of vertices is immediate by Corollary 14.51 
(4) and is achieved by [0, l] d . 

For a polytope P with an exact vertex set S, define a face pair to be an 
unordered pair (F± , F^ ) of proper faces of P such that SCIjUi^ and Fi 
and F% lie in parallel hyperplanes, or equivalently, there exists a linear form 
^Fi,F2 ( x ) such that hF lt F 2 (Fi) = and h-Fx^i^) — 1- We will show that if 
dim P = d then P has at most 2 d — 1 face pairs and 2 d facets. 

If d = 1, then an exact S consists of two distinct points and P has two 
facets and one face pair as desired. Assume the result holds for {d — 1)- 
polytopes with exact vertex sets and consider a d-polytope P with exact 
vertex set S. Let F be a facet of P which by Theorem 14.21 is in a face 
pair (F, F') of P. Since exactness does not depend on the affine embedding, 
we may assume that P is full-dimensional and that F spans the hyperplane 
{x : Xd = 0}, while F 1 lies in {x : x^ = 1}. By Corollary 14.51 F satisfies 
the induction hypothesis and so has at most (2 d_1 — 1) face pairs. Any face 
pair of P besides (F, F') induces a face pair of F by intersection with F, and 
every facet of P is in a face pair of P since S is exact. The plan is to count 
how many face pairs of P induce the same face pair of F and the number of 
facets they contain. 

Fix a face pair [F\, F2) of F, with associated linear form h,Fi,F 2 depending 
only on xi, . . . , x^-i- Suppose (Fi, F2) is induced by a face pair of P with 
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associated linear form H(x). Since H and /ifi,f 2 agree on every vertex of 
F, a facet of P, H(x) = hF 1 .F 2 (xi, ■ ■ ■ , Xd-i) + cxd for some constant c. 

If hp 1: F 2 {xi, . . . ,Xd-i) takes the same value v on all of F' , then H(F') = 
v + c = or 1 which implies that c = — v or c = 1 — v. The two possibilities 
lead to the face pairs (conv(Fi U F'), F 2 ) and (conv(F 2 U F'), Fi) of P. Each 
such pair contains at most one facet of P. 

If hp lt F 2 (xi, . . . ,Xd-\) takes more than one value on the vertices of F', 
then these values must be v and v + 1 for some v since H takes values and 
1 on the vertices of F' . In that case, c = — v, so H is unique and we get at 
most one face pair of P inducing (i 7 !,^). This pair will contain at most 
two facets of P. 

Since there are at most 2 d ~ 1 — 1 face pairs in F, they give us at most 
2(2 d ~ 1 — 1) face pairs and facets of P. Since we have not counted (F, F') as 
a face pair of P, and F and F' as possible facets of P, we get the desired 
result. The bound on the number of facets is attained by cross-polytopes. 

□ 

Remark 4.8. Giinter Ziegler has pointed out that our proof of Theorem 14, 71 
can be refined to yield that P (as used above) has 2 d — 1 face pairs if and only 
if it is a simplex and 2 d facets if and only if it is a regular cross-polytope. 

Recall that Problem ll.3l was inspired by perfect graphs. Theorem 02] adds 
to the characterizations of a perfect graph (c.f. Theorem 13. ip as follows. 

Corollary 4.9. For a graph G, let Sg denote the set of characteristic vectors 
of stable sets in G. Then the following are equivalent. 

(1) The graph G is perfect. 

(2) The stable set polytope, STAB(G), is a 2 -lev el polytope. 

A polytope P in M> is said to be down-closed if for all v G P and 
v' G M™q such that v[ < t?j for i = l,...,n, v' G P. For a graph G, 
STAB(G) is a down-closed 0/1-polytope, and G is perfect if and only if the 
vertex set of STAB(G) is exact. We now prove that all down-closed 0/1- 
polytopes with exact vertex sets are stable set polytopes of perfect graphs. 

Theorem 4.10. Let P C R n be a down-closed 0/1-polytope and S be its set 

of vertices. Then S is exact if and only if all facets of P are either defined 
by non-negativity constraints on the variables or by an inequality of the form 

Siei" x i — ^ f or some ^ ^= M • 

Proof: If P is not full-dimensional then since it is down-closed, it must be 
contained in a coordinate hyperplane X\ = and the arguments below can 
be repeated in this lower-dimensional space. So we may assume that P is 
n-dimensional. Then since P is down-closed, S contains {0, ei, . . . , e n }. 

If all facets of P are of the stated form, using that S C {0, l} n , it is 
straight forward to check that S is exact. 

Now assume that S is exact and g(x) > is a facet inequality of P that 
is not a non-negativity constraint. Then g(x) := c — Yli=i aiXi — ^ ^ or some 
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integers c, 01, . . . ,a n with c / 0. Since G S and S is exact, we get that 
g(s) equals or c for all s G 5. Therefore, for all i, g{ei) = c — a% equals 
or c, so a\ is either or c. Dividing through by c, we get that the facet 
inequality <?(x) > is of the form ^ ig j x% < 1 for some / C [n]. □ 

Corollary 4.11. Let P C R n 6e a full- dimensional down-closed 0/1-polytope 
and S be its vertex set. Then S is exact if and only if P is the stable set 
polytope of a perfect graph. 

Proof: By Corollary 14.91 we only need to prove the "only-if" direction. 
Suppose S is exact. Then by Theorem 14.101 all facet inequalities of P are 
either of the form X{ > for some i G [n] or Yliei x i — 1 f° r some ^ != M- 
Define the graph G = ([n],2£) where {i, j} G E if and only if C J for 

some / that indexes a facet inequality of P. 

We prove that P = STAB(G) and that G is perfect. Let K C [n] such 
that its characteristic vector \ K £ 5 1 - If there exists i,j G such that 
i,j G I for some I that indexes a facet inequality of P, then 1 — Y2%ei x i 
takes three different values when evaluated at the points 0, e,;, x K i n 5* which 
contradicts that S is exact. Therefore, K is a stable set of G and P C 
STAB(G). If A' C [n] is a stable set of G then, by construction, for every I 
indexing a facet inequality of P, \ K nes on either Yliel Xi = ^ or Sie/ = 
0. Therefore \ K ^ P and STAB(G) C P. Since all facet inequalities of 
STAB(G) are either non- negativities or clique inequalities, G is perfect by 
Theorem 9.2.4 hi.]. □ 



5. Arbitrary THi-exact Ideals 

In this last section we describe THi(/) for an arbitrary (not necessarily 
real radical or zero-dimensional) ideal I C R[x]. The main structural result 
is Theorem 1 5 . 41 which allows the construction of non-trivial high-dimensional 
THi-exact ideals as in Example 15.51 

In this study, the convex quadrics in R[x] play a particularly important 
role. These are precisely the polynomials of degree two that can be written as 
-F(x) = x*yLx + b*x+c, where A ^ is an n x n positive semidefinite matrix, 
b G M. n and c£l. Note that every sum of squares of linear polynomials in 
R[x] is a convex quadric. 

Lemma 5.1. For I C R[x], THi(7) 7^ M n if and only if there exists some 
convex quadric F G I. 

Proof: If THi(J) 7^ M n , there exists a degree one polynomial / that is 
strictly positive on THi(J), hence 1-sos modulo /. Then /(x) = g(x) mod 
I for some 1-sos g(x) 7^ and g(x) — /(x) G I is a convex quadric. 

Conversely, suppose x*Ax+b*x-|-c G I with ^4 y 0. Then for any d G M n , 

(x+d) 4 ^(x+d) = x*Ax+2d*^x+d i ^d = (2d i ^-b')x+d i ^d-c mod I. 
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Therefore, since (x + d) l A(x + d) is a sum of squares of linear polynomials, 
the linear polynomial (2d* A — b*)x + d'^4d — c is 1-sos mod I and THi(J) 
must satisfy it. Since d can be chosen so that (2d t A — b') 7^ 0, THi(I) is 
not trivial. □ 

Lemma 5.2. For an ideal I C R[x], THi(J) = f| THi((F)), where F varies 
over all convex quadrics in I . 

Proof: If F E I then (F) C I. Also, if / is linear and 1-sos mod (F) then 
it is also 1-sos mod /. Therefore, THi(J) C THi((i ? )). 

To prove the reverse inclusion, we need to show that if / is a linear polyno- 
mial that is nonnegative on THi(/), it is also nonnegative on f]p e j THi((i ? )), 
where F is a convex quadric. It suffices to show that whenever / is linear 
and 1-sos mod /, then there is a convex quadric F £ I such that /(x) > 
is valid for THi((F)), or equivalently that / is 1-sos mod (F). Since / is 
1-sos mod /, there is a sum of squares of linear polynomials <?(x) such that 
/(x) = </(x) mod /. But g is a convex quadric, hence so is p(x) — /(x). Thus 
/ is 1-sos mod the ideal (<?(x) — /(x)) and we can take F(x) = g(x.) — /(x). 
□ 

Lemma 5.3. // F(x) = x'Ax + b'x + c with A y 0, then THi((F)) = 
conv(V M (F)). 

Proof: We know that conv(V]R(i ? )) C THi((i ? )) and, since F is convex, 
conv(V K (i 7 )) = {x e R n : F(x) < 0}. Thus, if for every x E Vr(F) 
gradF(x) 7^ 0, then conv(V]R(F)) is supported by the tangent hyperplanes 
to Vr(.F). In this case, to show that THi((F)) C conv(ViR(F)), it suffices 
to prove that the defining (linear) polynomials of all tangent hyperplanes 
to Vir(-F) are 1-sos mod (F). The proof of the "if" direction of Lemma 15.11 
shows that it would suffice to prove that a tangent hyperplane to Vr(-F) has 
the form (2d 1 A - b*)x + d*^4d - c = 0, for some d G W 1 . The tangent at 
xo E Vr(-F) has equation = (2Axq + b)*(x — xo) which can be rewritten 
as 

= (2x < A + b*)x - 2xf,ylxo - b*x = (2x* A + b*)x - x* ylxo + c, 

and so setting d = — xo gives the result. 

Suppose there is an xo such that F(xq) = and gradF(xo) = 0. By trans- 
lation we may assume that xo = 0, hence, c = and b = 0. Therefore F = 
xMx = where the hi are linear. Since Vr((F)) = Vr((/ii, . . . , h m )) it 

is enough to prove that all inequalities ±hi > are valid for THi((i 7 )). For 
any e > we have 

(±hi + ef + = F ± 2ehi + e 2 = 2e(±h t + e/2) mod (F), 

so ±hi + e/2 is 1-sos mod (F) for all I and all e > 0. This implies that all 
the inequalities ±/i/ + e/2 > are valid for THi((i ? )), therefore so are the 
inequalities ±hi > 0. □ 
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Theorem 5.4. Let I C M[x] be any ideal, then 

THi(/)= p| conv(V K (F))= f] {x G R n : F(x) < 0}. 



F convex quadric F conuea: quadric 



Proof: Immediate from Lemma 15.21 and Lemma 15.31 □ 

Example 5.5. Theorem 15.41 shows that some non-principal ideals such as 
I = (x 2 - z,y 2 -z) C R[x,y,z] are THi-exact. Since V R (T) = {{±t,±t,t 2 ) : 
t G M}, fixing the third coordinate we get the four points (x,y,t 2 ) where 
| a; | = \y\ = \t\ which implies that 

conv(V K (/)) 5 {(x,y,t 2 ) : \x\ < t, \y\ <t,t> 0}. 

It is easy to see that the right hand side is equal to {(x, y, z) : x 2 < z, y 2 < 
z} which is exactly conv(V]R(x 2 — z)) f] conv(VR(y 2 — z)) and so contains 
THi(I) which contains conv(V]R(/)). So all inclusions must be equalities 
and I is THi -exact. This kind of reasoning allows us to construct non- 
trivial examples of THi-exact ideals with high-dimensional varieties. 

Example 5.6. Consider the set S = {(0, 0), (1, 0), (0, 1), (2, 2)}. Then the 
family of all quadratic curves in I(S) is 

a(x 2 -x)+b(y 2 -y)-{^-^-)xy = (x,y) 



X }—ax—by. 

y J y 

Since the case where both a and b are zero is trivial, we may normalize by 
setting a + b = 1 and get the matrix in the quadratic to be 

A -1/4 
-1/4 1-A 

with A > 0. This matrix is positive semidefinite if and only if A(l — A) — 
1/16 > 0, or equivalently, if and only if A G [1/2 - y/3/4, 1/2 + >/3/4]. 
This means that (x,y) £ THi(X(5)) if and only if, for all such A, 

A(x 2 - x) + (1 - X)(y 2 -y)- l -xy < 0. 

Since the right-hand-side does not depend on A, and the left-hand-side is 
a convex combination of x 2 — x and y 2 — y, the inequality holds for every 
A G [1/2 - V3/4, 1/2 + y/3/4] if and only if it holds at the end points of the 
interval. Equivalently, if and only if 

y) ~ ^xy < 0, 



and 




. 1 
y) ~ ~xy < 0. 



But this is just the intersection of the convex hull of the two curves obtained 
by turning the inequalities into equalities. Figure [3] shows this intersection. 
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